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3 <110> APPLICANT: MATSUMOTO, KAZUYA 

4 KAZUNO, YASUSHI 

5 HIGASHIMURA, NORIKAZU 

6 OHSHIMA, TOSHIHISA 

7 • SAKURABA, HARUHIKO 

9 <120> TITLE OF INVENTION: PROCESS FOR PRODUCING CHIRAL HYDROXYALDEHYDE COMPOUNDS 







<130> 


FILE REFERENCE: 1034232-000045 
















c- - > 




<140> 


CURRENT APPLICATION NUMBER: US/10/594,731 














1 4 


<141> 


CURRENT PILING DATE: 2006 


-09- 


29 


















± o 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


PCT/JP2005/005719 














<151> 


PRIOR 


FILING DATE 


: 2005-03-28 




















1 Q 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


JP 


2004- 


-095263 














0 0 
^ \j 


<151> 


PRIOR 


FILING DATE 


: 2004-03-29 




















22 


<160> 


NUMBER OF 


SEQ 


ID NOS: 


10 






















24 


<170> 


SOFTWARE : 


Patent In Ver. 3 


.3 




















26 


<210> 


SEQ ID NO 


: 1 




























27 


<211> 


LENGTH: 747 




























28 


<212> 


TYPE: 


DNA 






























29 


<213> 


ORGANISM: 


Thermotoga maritima 




















31 


<220> 


FEATURE: 






























32 


<221> 


NAME /KEY: 


CDS 




























33 


<222> 


LOCATION: 


(1) 


. . (744) 
























35 


<400> 


SEQUENCE: 


1 




























36 


atg 


ata 


gag 


tac 


agg 


att 


gag 


gag 


gca 


gta 


gcg 


aag 


tac 


aga 


gag 


ttc 


48 




37 


Met 


He 


Glu 


Tyr 


Arg 


He 


Glu 


Glu 


Ala 


Val 


Ala 


Lys 


Tyr 


Arg 


Glu 


Phe 






38 


1 










5 










10 










15 








40 


tac 


gaa 


ttc 


aag 


cec 


gtc 


aga 


gaa 


age 


gca 


ggt 


att 


gaa 


gat 


gtg 


aaa 


96 




41 


Tyr 


Glu 


Phe 


Lys 


Pro 


Val 


Arg 


Glu 


Ser 


Ala 


Gly 


He 


Glu 


Asp 


Val 


Lys 






42 










20 










25 










30 










44 


agt 


get 


ata 


gag 


cae 


aeg 


aat 


ctg 


aaa 


ccg 


ttt 


gee 


aea 


cea 


gae 


gat 


144 




45 


Ser 


Ala 


He 


Glu 


His 


Thr 


Asn 


Leu 


Lys 


Pro 


Phe 


Ala 


Thr 


Pro 


Asp 


Asp 






46 








35 










40 










45 












48 


ata 


aaa 


aaa 


etc 


tgt 


ctt 


gaa 


gca 


agg 


gaa 


aat 


cgt 


ttc 


eat 


gga 


gtc 


192 




49 


He 


Lys 


Lys 


Leu 


Cys 


Leu 


Glu 


Ala 


Arg 


Glu 


Asn 


Arg 


Phe 


His 


Gly 


Val 






50 




50 










55 










60 














52 


tgt 


gtg 


aat 


ccg 


tgt 


tat 


gtg 


aaa 


ctg 


get 


cgt 


gaa 


gaa 


etc 


gaa 


gga 


240 




53 


Cys 


Val 


Asn 


Pro 


Cys 


Tyr 


Val 


Lys 


Leu 


Ala 


Arg 


Glu 


Glu 


Leu 


Glu 


Gly 






54 


65 












70 










75 










80 






56 


acc 


gat 


gtg 


aaa 


gtc 


gtc 


acc 


gtt 


gtt 


ggt 


ttt 


cea 


etg 


gga 


gcg 


aae 


288 




57 


Thr 


Asp 


Val 


Lys 


Val 


Val 


Thr 


Val 


Val 


Gly 


Phe 


Pro 


Leu 


Gly 


Ala 


Asn 






58 












85 










90 










95 








60 


gaa 


act 


egg 


aeg 


aaa 


gee 


cat 


gag 


gcg 


att 


ttc 


get 


gtt 


gag 


agt 


gga 


336 




61 


Glu 


Thr 


Arg 


Thr 


Lys 


Ala 


His 


Glu 


Ala 


He 


Phe 


Ala 


Val 


Glu 


Ser 


Gly 
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62 






100 










105 










110 








64 


gee 


gat gag 


ate 


gat 


atg 


gte 


ate 


aac 


gtt 


ggc 


atg 


etc 


aag 


gca 


aag 


384 


65 


Ala 


Asp Glu 


He 


Asp 


Met 


Val 


He 


Asn 


Val 


Gly 


Met 


Leu 


Lys 


Ala 


Lys 




66 




115 










120 










125 










68 


gag 


tgg gag 


tac 


gtt 


tac 


gag 


gat 


ata 


aga 


agt 


gtt 


gte 


gaa 


teg 


gtg 


432 


69 


Glu 


Trp Glu Tyr 


Val 


Tyr 


Glu 


Asp 


He 


Arg 


Ser 


Val 


Val 


Glu 


Ser 


Val 




70 




130 








135 










140 












72 


aaa 


gga aaa 


gtt 


gtg 


aag 


gtg 


ate 


ate 


gaa 


acg 


tge 


tat 


ctg 


gat 


aeg 


480 


73 


Lys 


Gly Lys 


Val 


Val 


Lys 


Val 


He 


He 


Glu 


Thr 


Cys 


Tyr 


Leu 


Asp 


Thr 




74 


145 








150 










155 










160 




76 


gaa 


gag aag 


ata 


gcg 


geg 


tgt 


gte 


att 


tee 


aaa 


ett 


get 


gga 


get 


eat 


528 


77 


Glu 


Glu Lys 


He 


Ala 


Ala 


Cys 


Val 


He 


Ser 


Lys 


Leu 


Ala 


Gly 


Ala 


His 




78 








165 










170 










175 






80 


ttc 


gtg aag 


act 


tec 


acg 


gga 


ttt 


gga 


aca 


gga 


ggg 


gcg 


acc 


gca 


gaa 


576 


81 


Phe 


Val Lys 


Thr 


Ser 


Thr 


Gly 


Phe 


Gly 


Thr 


Gly 


Gly 


Ala 


Thr 


Ala 


Glu 




82 






180 










185 










190 








84 


gac 


gtt cat 


etc 


atg 


aaa 


tgg 


ate 


gtg 


gga 


gat 


gag 


atg 


ggt 


gta 


aaa 


624 


85 


Asp 


Val His 


Leu 


Met 


Lys 


Trp 


He 


Val 


Gly 


Asp 


Glu 


Met 


Gly 


Val 


Lys 




86 




195 










200 










205 










88 


get 


tec gga 


ggg 


ate 


aga 


ace 


ttc 


gag 


gac 


get 


gtt 


aaa 


atg 


ate 


atg 


672 


89 


Ala 


Ser Gly Gly 


He 


Arg 


Thr 


Phe 


Glu 


Asp 


Ala 


Val 


Lys 


Met 


He 


Met 




90 




210 








215 










220 












92 


tac 


ggt get 


gat 


aga 


ata 


gga 


acg 


agt 


teg 


gga 


gtt 


aag 


ate 


gtt 


eag 


720 


93 


Tyr 


Gly Ala Asp 


Arg 


He 


Gly 


Thr 


Ser 


Ser 


Gly 


Val 


Lys 


He 


Val 


Gin 




94 


225 








230 










235 










240 




96 


ggg 


gga gaa 


gag 


aga 


tat 


gga 


ggt 


tga 
















747 


97 


Gly 


Gly Glu Glu 


Arg 


Tyr 


Gly 


Gly 




















98 








245 



























101 <210> SEQ ID NO: 2 

102 <211> LENGTH: 248 

103 <212> TYPE: PRT 

104 <213> ORGANISM: Thermotoga maritima 
106 <400> SEQUENCE: 2 



107 


Met 


He 


Glu 


Tyr 


Arg 


He 


Glu 


Glu 


Ala 


Val 


Ala 


Lys 


Tyr 


Arg 


Glu 


Phe 


108 


1 








5 










10 










15 




110 


Tyr 


Glu 


Phe 


Lys 


Pro 


Val 


Arg 


Glu 


Ser 


Ala 


Gly 


He 


Glu 


Asp 


Val 


Lys 


111 








20 










25 










30 






113 


Ser 


Ala 


He 


Glu 


His 


Thr 


Asn 


Leu 


Lys 


Pro 


Phe 


Ala 


Thr 


Pro 


Asp 


Asp 


114 






35 










40 










45 








116 


He 


Lys 


Lys 


Leu 


Cys 


Leu 


Glu 


Ala Arg Glu 


Asn 


Arg 


Phe 


His 


Gly 


Val 


117 




50 










55 










60 










119 


Cys 


Val 


Asn 


Pro 


Cys 


Tyr 


Val 


Lys 


Leu 


Ala 


Arg 


Glu 


Glu 


Leu 


Glu 


Gly 


120 


65 










70 










75 










80 


122 


Thr 


Asp 


Val 


Lys 


Val 


Val 


Thr 


Val 


Val 


Gly 


Phe 


Pro 


Leu 


Gly 


Ala 


Asn 


123 










85 










90 










95 




125 


Glu 


Thr 


Arg 


Thr 


Lys 


Ala 


His 


Glu 


Ala 


He 


Phe 


Ala 


Val 


Glu 


Ser 


Gly 


126 








100 










105 










110 






128 


Ala 


Asp 


Glu 


He 


Asp 


Met 


Val 


He 


Asn 


Val 


Gly 


Met 


Leu 


Lys 


Ala 


Lys 


129 






115 










120 










125 
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131 


Glu Trp Glu Tyr 


Val 


Tyr 


Glu 


Asp 


He 


Arg 


Ser 


Val 


Val 


Glu 


Ser 


Val 




132 


130 






135 










140 












134 


Lys Gly Lys Val 


Val 


Lys 


Val 


He 


He 


Glu 


Thr 


Cys 


Tyr 


Leu 


Asp 


Thr 




135 


145 




150 










155 










160 




137 


Glu Glu Lys lie 


Ala 


Ala 


Cys 


Val 


He 


Ser 


Lys 


Leu 


Ala 


Gly 


Ala 


His 




138 




165 










170 










175 






140 


Phe Val Lys Thr 


Ser 


Thr 


Gly 


Phe 


Gly 


Thr 


Gly 


Gly 


Ala 


Thr 


Ala 


Glu 




141 


180 










185 










190 








143 


Asp Val His Leu 


Met 


Lys 


Trp 


He 


Val 


Gly 


Asp 


Glu 


Met 


Gly 


Val 


Lys 




144 


195 








200 










205 










146 


Ala Ser Gly Gly 


He 


Arg 


Thr 


Phe 


Glu 


Asp 


Ala 


Val 


Lys 


Met 


He 


Met 




147 


210 






215 










220 












149 


Tyr Gly Ala Asp 


Arg 


He 


Gly 


Thr 


Ser 


Ser 


Gly 


Val 


Lys 


He 


Val 


Gin 




150 


225 




230 










235 










240 




152 


Gly Gly Glu Glu 


Arg 


Tyr 


Gly 


Gly 




















153 




245 


























157 


<210> SEQ ID NO; 


: 3 


























158 


<211> LENGTH: 681 


























159 


<212> TYPE: DNA 




























160 


<213> ORGANISM: 


Pyrobaculum 


aerophilum 
















162 


<220> FEATURE: 




























163 


<221> NAME/KEY: 


CDS 


























164 


<222> LOCATION: 


(1) 


. . (678) 






















166 


<400> SEQUENCE: 


3 


























167 


atg ata cat tta 


gta 


gac 


tac 


gcg 


ctt 


etc 


aag 


ecg 


tat 


etc 


aca 


gta 


48 


168 


Met lie His Leu 


Val 


Asp 


Tyr 


Ala 


Leu 


Leu 


Lys 


Pro 


Tyr 


Leu 


Thr 


Val 




169 


1 


5 










10 










15 






171 


gat gaa gca gtc 


gcc 


ggg 


get 


cgc 


aag 


gcg 


g^g 


gag 


ctg 


ggc 


gtc 


gcg 


96 


172 


Asp Glu Ala Val 


Ala 


Gly 


Ala 


Arg 


Lys 


Ala 


Glu 


Glu 


Leu 


Gly 


Val 


Ala 




173 


20 










25 










30 








175 


gcg tat tgc gta 


aat 


ccc 


ata 


tac 


gcc 


cct 


gtt 


gtt 


egg 


cct 


ttg 


ttg 


144 


176 


Ala Tyr Cys Val 


Asn 


Pro 


He 


Tyr 


Ala 


Pro 


Val 


Val 


Arg 


Pro 


Leu 


Leu 




177 


35 








40 










45 










179 


egg aaa gta aag 


etc 


tgc 


gta 


gtg 


gcg 


gac 


ttc 


ccc 


ttt 


ggg 


gcc 


ttg 


192 


180 


Arg Lys Val Lys 


Leu 


Cys 


Val 


Val 


Ala 


Asp 


Phe 


Pro 


Phe 


Gly 


Ala 


Leu 




181 


50 






55 










60 












183 


cca acg gcc age 


aga 


att 


gcc 


ttg 


gtt 


tct 


agg 


ctt 


get 


gaa 


gtg 


gca 


240 


184 


Pro Thr Ala Ser 


Arg 


He 


Ala 


Leu 


Val 


Ser 


Arg 


Leu 


Ala 


Glu 


Val 


Ala 




185 


65 




70 










75 










80 




187 


gat gag ata gac 


gtg 


gtg 


gcg 


cct 


ata 


ggc 


etc 


gtg 


aaa 


teg 


egg 


agg 


288 


188 


Asp Glu lie Asp 


Val 


Val 


Ala 


Pro 


He 


Gly 


Leu 


Val 


Lys 


Ser 


Arg 


Arg 




189 




85 










90 










95 






191 


tgg gcc gag gtg 


aga 


agg 


gac 


tta 


ata 


age 


gtt 


gtg 


ggt 


gee 


gca 


ggc 


336 


192 


Trp Ala Glu Val 


Arg 


Arg 


Asp 


Leu 


He 


Ser 


Val 


Val 


Gly 


Ala 


Ala 


Gly 




193 


100 










105 










110 








195 


ggg aga gtg gta 


aag 


gta 


ate 


aca 


gag 


gag 


cct 


tat 


eta 


agg 


gat 


gag 


384 


196 


Gly Arg Val Val 


Lys 


Val 


He 


Thr 


Glu 


Glu 


Pro 


Tyr 


Leu 


Arg 


Asp 


Glu 




197 


115 








120 










125 










199 


gag agg tat acg 


ctt 


tae 


gac 


att 


att 


gca 


gag 


get 


ggg 


gcc 


cac 


ttt 


432 
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200 


Glu Arg Tyr Thr Leu Tyr 


Asp 


He 


He 


Ala 


Glu 


Ala 


Gly 


Ala 


His 


Phe 


201 


130 








135 










140 










203 


ata aaa age 


tec 


aet 


gga 


tte 


gcc 


gaa 


gag 


gcc 


tac 


gcc 


gcc 


aga 


cag 


204 


lie Lys Ser 


Ser Thr Gly 


Phe 


Ala 


Glu 


Glu 


Ala 


Tyr 


Ala 


Ala 


Arg 


Gin 


205 


145 






150 










155 










160 


207 


gga aat cct 


gta 


cae 


tea 


acg 


ccg 


gag 


agg 


geg 


geg 


gca 


att 


gee 


cgc 


208 


Gly Asn Pro Val 


His 


Ser 


Thr 


Pro 


Glu 


Arg Ala 


Ala 


Ala 


He 


Ala 


Arg 


209 






165 










170 










175 




211 


tac ata aaa 


gag 


aag 


ggg 


tat 


aga 


ctg 


ggg 


gtg 


aaa 


atg 


geg 


ggg 


ggg 


212 


Tyr lie Lys Glu Lys Gly 


Tyr 


Arg 


Leu 


Gly Val 


Lys 


Met 


Ala 


Gly 


Gly 


213 




180 










185 










190 






215 


att agg aca 


agg 


gag 


cag 


gca 


aag 


gee 


att 


gtt 


gac 


gcc 


att 


gga 


tgg 


216 


lie Arg Thr Arg Glu Gin 


Ala 


Lys 


Ala 


He 


Val 


Asp 


Ala 


He 


Gly 


Trp 


217 


195 










200 










205 








219 


ggc gag gac 


cca 


gcc 


cgc 


gte 


agg 


ctg 


ggg 


acg 


tec 


ace 


cca 


gag 


get 


220 


Gly Glu Asp Pro Ala Arg 


Val 


Arg 


Leu 


Gly Thr 


Ser 


Thr 


Pro 


Glu 


Ala 


221 


210 








215 










220 










223 


ctt eta tag 




























224 


Leu Leu 




























225 


225 




























228 


<210> SEQ ID NO 


: 4 
























229 


<211> LENGTH: 226 
























230 


<212> TYPE: 


PRT 


























231 


<213> ORGANISM: 


Pyrobaculum 


aerophilum 














233 


<400> SEQUENCE: 


4 
























234 


Met He His 


Leu 


Val 


Asp 


Tyr 


Ala 


Leu 


Leu 


Lys 


Pro 


Tyr 


Leu 


Thr 


Val 


235 


1 




5 










10 










15 




237 


Asp Glu Ala 


Val 


Ala 


Gly 


Ala 


Arg 


Lys 


Ala 


Glu 


Glu 


Leu 


Gly 


Val 


Ala 


238 




20 










25 










30 






240 


Ala Tyr Cys 


Val 


Asn 


Pro 


He 


Tyr 


Ala 


Pro 


Val 


Val 


Arg 


Pro 


Leu 


Leu 


241 


35 










40 










45 








243 


Arg Lys Val 


Lys 


Leu 


Cys 


Val 


Val 


Ala 


Asp 


Phe 


Pro 


Phe 


Gly 


Ala 


Leu 


244 


50 








55 










60 










246 


Pro Thr Ala 


Ser 


Arg 


He 


Ala 


Leu 


Val 


Ser Arg 


Leu 


Ala 


Glu 


Val 


Ala 


247 


65 






70 










75 










80 


249 


Asp Glu He 


Asp 


Val 


Val 


Ala 


Pro 


He 


Gly Leu 


Val 


Lys 


Ser 


Arg 


Arg 


250 






85 










90 










95 




252 


Trp Ala Glu 


Val 


Arg 


Arg 


Asp 


Leu 


He 


Ser 


Val 


Val 


Gly 


Ala 


Ala 


Gly 


253 




100 










105 










110 






255 


Gly Arg Val 


Val 


Lys 


Val 


He 


Thr 


Glu 


Glu 


Pro 


Tyr 


Leu 


Arg 


Asp 


Glu 


256 


115 










120 










125 








258 


Glu Arg Tyr 


Thr 


Leu 


Tyr 


Asp 


He 


He 


Ala 


Glu 


Ala 


Gly 


Ala 


His 


Phe 


259 


130 








135 










140 










261 


He Lys Ser 


Ser 


Thr 


Gly 


Phe 


Ala 


Glu 


Glu 


Ala 


Tyr 


Ala 


Ala 


Arg 


Gin 


262 


145 






150 










155 










160 


264 


Gly Asn Pro 


Val 


His 


Ser 


Thr 


Pro 


Glu 


Arg 


Ala 


Ala 


Ala 


He 


Ala 


Arg 


265 






165 










170 










175 




267 


Tyr He Lys 


Glu 


Lys 


Gly 


Tyr 


Arg 


Leu 


Gly Val 


Lys 


Met 


Ala 


Gly 


Gly 


268 




180 










185 










190 







480 



528 



576 



624 



672 



681 
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270 He Arg Thr Arg Glu Gin Ala Lys Ala He Val Asp Ala He Gly Trp 

271 195 200 205 

273 Gly Glu Asp Pro Ala Arg Val Arg Leu Gly Thr Ser Thr Pro Glu Ala 

274 210 215 220 

276 Leu Leu 

277 225 

281 <210> SEQ ID NO: 5 

282 <211> LENGTH: 30 

283 <212> TYPE: DNA 

284 <213> ORGANISM: Artificial Sequence 

286 <220> FEATURE: 

287 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

288 Primer 

290 <400> SEQUENCE: 5 

291 tatatcatat gatagagtac aggattgagg 30 

294 <210> SEQ ID NO: 6 

295 <211> LENGTH: 30 

296 <212> TYPE: DNA 

297 <213> ORGANISM: Artificial Sequence 

299 <220> FEATURE: 

300 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

301 Primer 

303 <400> SEQUENCE: 6 

304 taatggatcc tcaacctcca tatctctctt 30 

307 <210> SEQ ID NO: 7 

308 <211> LENGTH: 30 

309 <212> TYPE: DNA 

310 <213> ORGANISM: Artificial Sequence 

312 <220> FEATURE: 

313 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

314 Primer 

316 <400> SEQUENCE: 7 

317 tatatcatat gatacattta gtagactacg 30 

320 <210> SEQ ID NO: 8 

321 <211> LENGTH: 30 

322 <212> TYPE: DNA 

323 <213> ORGANISM: Artificial Sequence 

325 <220> FEATURE: 

326 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

32 7 Primer 

329 <400> SEQUENCE: 8 

330 taatggatcc ctatagaaga gcctctgggg 30 

333 <210> SEQ ID NO: 9 

334 <211> LENGTH: 30 

335 <212> TYPE: DNA 

336 <213> ORGANISM: Artificial Sequence 

338 <220> FEATURE: 

339 <223> OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

340 Primer 
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